Binarization of Gray-Scaled Digital Images Via Fuzzy Reasoning 


Jesus A. Dominguez, Steve Klinko 
Dynacs, Inc. at NASA Kennedy Space Center 


Abstract - A new fast-computational technique based on fuzzy entropy measure has 
been developed to find an optimal binary image threshold. In this method, the image 
pixel membership functions are dependent on the threshold value and reflect the 
distribution of pixel values in two classes ; thus, this technique minimizes the 
classification error. This new method is compared with two of the best-known 
threshold selection techniques, Otsu and Huang-Wang. The performance of the 
proposed method supersedes the performance of Huang-Wang and Otsu methods when 
the image consists of textured background and poor printing quality. The three 
methods perform well but yield different binarization approaches if the background 
and foreground of the image have well-separated gray-level ranges. 

1. Introduction 

Binary image processing is of special interest because an image with binary format can 
be processed with very fast logical (Boolean) operators. Since most image display 
systems and software assume images of 8 or more bits per pixel, the binarization of these 
images usually takes 2 extreme gray tones, black and white, which are ordinarily 
represented by 0 or 255, respectively, in an 8-bit grayscale display environment. 

Usually a binary image is obtained from an 8-bit gray-level image by thresholding and 
assigning the low binary (0) and high (255) values to all gray levels based on the chosen 
threshold. Obviously, the threshold that is chosen has a critical importance since it 
controls the binary-based pattern classification that is obtained from the gray-level 
image. The key issue is to choose an optimal threshold so that the number of misclassified 
image pixels is kept as low as possible. Image thresholding is the simplest image 
segmentation approach. It is actually a pattern classification procedure in which only 
one input feature is involved, the pixel intensity value. The key issue here is to choose an 
optimal threshold value so that the number of misclassified image pixels is kept as low as 
possible. 

Fuzzy logic is a powerful tool to solve many image processing problems because of its 
ability to deal with ambiguous data. Selection of a threshold to binarize an image is not 
straightforward because of ambiguity or fuzziness caused by the overlapping of two 
class densities. Several fuzzy-model-based methods have been developed in the past and 
recently to overcome the difficulties. For example, an optimal threshold value can be 
determined based on Pal and Rosenf eld’s fuzzy compactness measure and Pal and 
Ghosh’s index of area coverage measure [1], [2J. In these two methods, for each 
possible threshold-value in. the grayscale, range (0 to 255 range for an 8-bit grayscale 
image), the membership value of every pixel is needed to compute the compactness or 
index of area coverage measure. When the image is large in size, these methods can 



require a long computing time to search for the optimal threshold value. Huang and 
Wang recently developed a fuzzy thresholding algorithm that uses the image pixel value 
histogram but does not need to deal with each individual pixel [3]. This method can be 
very efficiently implemented and requires much less computing time than the above 
fuzzy-based methods. We have used Huang -Wang method as one of the image 
preprocessing operators within the implementation of a Visual Anomaly Detection 
Prototype System built for NASA at the Kennedy Space Center [4]. Another widely used 
and well-known digital image binarization method is Otsu’s method, which is based on 
discriminant analysis to maximize some measures of class separability [5]. 

Similar to the Huang-Wang method, the proposed fuzzy reasoning image binarization 
approach uses the entropy measure as the criterion for selection of the optimal image 
threshold. Experimental results using real images via a PC with 1.8-GHz 1-GB 
capabilities showed that the proposed fuzzy thresholding method performed more reliably 
than Huang-Wang’ s and Otsu’s methods for different types of images. Central 
processing unit (CPU) execution time is also a very important factor when using a given 
image binarization method in real-time systems; the proposed method, in general, 
requires similar CPU execution time to Otsu’s method and much less execution time than 
Huang-Wang’ s method. 

2. Implementation 

In the proposed method, described in appendix A, we consider an image as an array of 
singletons corresponding to image pixels (see equation 1, appendix A), each having a 
membership value associated with a property of the pixel, the grayness level. For image 
thresholding, the membership function is defined in terms of the grade of pixel's grayness 
belonging to one of the two classes, background and foreground, as shown in appendix A, 
figure 1. The membership function is calculated via a triangular function at each of the 
two classes (see appendix A, equation 4); the triangular function in each of the two 
classes is built using the average image grayness and histogram-based weight values at 
each class as shown in appendix A, equations 2 and 3. In each of the two classes, the 
membership value ( equal to 1) is the largest at the class average gray level and reduces 
its value when the difference between the pixel gray level and its class average level 
increases. This means that pixels with gray levels close to their corresponding class 
average gray levels have less fuzziness or ambiguity and thus can be classified with 
greater confidence than pixels with gray levels far from their class gray levels. The 
image entropy measure is used as a cost function to find the optimal threshold (equation 
6). It is defined using the histogram information as shown in appendix A, equation 5. 
The entropy factor needed to compute the- entropy measure is calculated using a simple 
and fast computational linear function as shown in appendix A, figure 2. 

The proposed method uses a similar (but more 'efficient and faster computational 
approach than ’ the, one ‘used in Huang-Wang method. Huang-Wang method uses a 
symmetric membership ■ function, while the proposed approach uses a more realistic 
membership function having the highest ' and lowest gray levels holding nonzero 
histogram values as the domain limits. Huang-Wang method limits the membership 
values within a range of 0.5 to 1.0 since it uses Shanon's entropy function as a cost 


function and the cost should decrease as the membership function value increases (as the 
fuzziness becomes smaller); Huang-Wang method allows the use of Shannon function in 
the range of 0.5 to 1.0 since the Shannon function values decrease only within this range 
as the membership value increases. This is why Huang-Wang method imposes this 
restriction to the membership function. In contrast, the new proposed method does not 
restrict the range of membership values and uses a straightforward triangular-type 
membership function. The proposed approach also uses a straight-line cost function that 
requires much less computational power than the Shannon function used by Huang-Wang 
method. 

3. Examples 

The performance of the proposed thresholding method was compared to Huang-Wang 
and Otsu methods, two of the most reliable image threshold methods, using 4 different 8- 
bit grayscale images as shown in figures 1 and 2. The binary threshold results for each 
image using Otsu, Huang-Wang, and the proposed method are shown in figures 3, 4, 5, 
and 6. Figure 7 summarizes the CPU execution time required by each threshold method 
to find the optimal threshold and generate the respective binary image. For images with 
textured background and poor printing quality ( images 1 and 4), the proposed method 
has a consistently better overall binarization performance than Huang-Wang and Otsu 
methods, as shown in figures 3 and 6; the CPU execution time for the proposed method is 
in general much lower than the CPU time required by Huang-Wang method and almost 
the same as the CPU execution time required by Otsu’s method, as illustrated in figure 7 . 
In images 2 and 3 (figures 4 and 5, respectively) the Huang-Wang and the proposed 
methods yielded similar binarization performance, but the proposed method requires less 
CPU execution time. In image 3 (figure 5), the Otsu method yielded a different 
binarization performance than the other two methods (Wang-Huang and the proposed 
one), requiring similar CPU execution time to the proposed method. While Otsu’s 
method successfully extracted the general borders of image 3, the other two methods 
(Huang-Wang and the proposed one) extracted a relevant part of the image (the numbers 
printed on the compact disk). 

In general, the proposed and Otsu methods require comparable CPU execution time, 
while the Huang-Wang method requires much more CPU execution time, as shown in 
figure 7. The performance of the proposed method supersedes the performance of 
Huang-Wang and Otsu methods when the image consists of textured background and 
poor printing quality. The three methods perform well but yield different binarization 
approaches if the background and foreground of the image have well-separated gray- 
level ranges. 

4. Conclusions 

The proposed thresholding method supersedes Huang-Wang and Otsu methods, the most 
reliable and fastest thresholding methods, when the image contains textured background 
and poor printing quality (see figures 3 and 6). This would provide a much simpler, 
faster, and more reliable binary segmentation tool for Optical Character Recognition 
(OCR) and Optical Character Validation (OCV) applications and texture analysis. This 
method would also provide fast and reliable segmentation capability to extract certain 



areas of interest (see figures 4 and 5), reducing considerably the CPU execution time 
required to extract them using existing gray-scale-based approaches. Since the proposed 
method requires relatively low computational time compared with Huang-Wang and Otsu 
methods, it might be used as a binarization operator within real-time image analysis 
systems. 
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Original 8-Bit Gray Scale: Image 1_ 
Imape Size: 246x245 



Huang-Wang Binarization Method 

CPU Execution Time: 10.8 ms 
Optima! Threshold: 89 



Otsu’s Binarization Method 

CPU Execution Time: 1.5 ms 
Optimal Threshold: 88 
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Figure 3. 



Original 8-Bit Gray. Scales Image 2 
Image Size: 434x667 



Huana-Wana Binanzatign Method 

CPU Execution Time: 14.3 ms 
Optimal Threshold: 146 



Otsu’s Bjnarization Method 

CPU Execution Time: 2.3 ms 
Optima I Threshold: 117 



Proposed Binarization Method 

CPU Execution Time: 3.5 ms 
Optimal Threshold: 162 


Figure 4. 



Original SSjt Gray. Scales Image 3 
Image Size: 54 1 x4 73 


Otsu’s Binarization Method 

CPU Execution Time: 3.9 ms 



Huang-Wang Binarization Method 

CPU Execution Time: 16.1ms 
Optimal Threshold: 191 


Proposed Binarization Method 

CPU Execution Time: 4.0 ms 
Optimal Threshold: 1 78 


Figure 5. 
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Otsu’s Binarization Method 


Image Size: 434x607 


CPU Execution Time: 5.2 ms 
Optimal Threshold: 89 
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Huana-Wana Method 


CPU Execution Time: 15.4 ms 


Findings of ancient martian microbiai fossils in 
meteorites and liquid water related features on 
Mars’ surtace are currently controversial issues. 
But one thing long established by space based 
observations cf the Reci Planet is the presence of 
volcanos, as Mars supports some of the largest 
volcanos in the solar system. This synthetic 
color picture recorded in March by the Mars 
Global Surveyor spacecraft shows two of litem, 
Ceraunius Tholus (leftmost) and U rani us Tholes, 
Found north of the Tharsis region of. truly large 
martian . volcanos, these are. actually two 
relatively small volcanos. Ceraunius Thclus 
being only about the size of the Big Island of 
Hawaii on planet trarih. Impact' craters which 
overlay the volcanic .martian terrain indicate that 
these . volcanos are . themselves ancient and 
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Optimal Threshold: 60 


Optimal Threshold: 22 


Figure 6. 
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Optimal Digital Image Threshold Performance 
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Figure 7. 



APPENDIX A: MATHEMATICAL FORMULATION 


The image is defined as an array of fuzzy singletons corresponding to image pixels, each 
having a membership value associated with a certain property of the pixel. Under this 
assumption, an image I can be represented as 

I = \f{x,y),n,{f{x,y))\ (i) 

For image thresholding, the membership function /Ji(f(x,y)) can be defined in terms of the 
grade of pixel (x,y) belonging to one of the two classes, background and foreground. The 
membership function in each of these two classes is built based on the average gray level 
of each class, which is computed using the gray-level histogram as average weight factor 
as shown in equation 2. 


G 1 (7)=£kfffe)]/£//(z) 

MinZ MinZ 

(2) 

MaxZ MaxZ 

G 2 {T)=^[zH(z)]/J J H {z) 

(3) 


r+i r+i 


Domain: 0 < MaxZ , MinZ, T, and z <L- 1 


Where 

T 

z 

MinZ 

MaxZ 

L 

H(z) 

Gi(T) 

<h(T) 


Threshold value 
Gray level 

Lowest gray level holding a nonzero histogram value 
Highest gray level holding a nonzero histogram value 
Total gray-level values; for an 8-bit image, L = 2 8 = 256 
Image histogram value of gray level z 
Average gray-level value for class 1 (background) 
Average gray-level value for class 2 (foreground) 


The membership function is linear-triangular-type as shown below and defined as 

f [z - MinZ\[G x (T)- MinZ ] if MinZ < z < G, (r) ^ 

[t -z\\T- G, (r)] if G, (t) < Z < T 

[z - t\[g 2 (r)- T ] if T < Z < g 2 (t) 

[MaxZ - z\\_MaxZ - G 2 (T )] if G 2 {T)<z<MaxZ 


Mibh 


(4) 




Figure 1 . Membership Function 


The entropy measure is used as a cost function for the selection of the optimal image 
threshold. It is defined using the histogram information as 

MaiZ 

S^)=^/\MNlog2]^H(z)Se\M I (z)] (5) 

MmZ 

Where 

S(T) = Entropy measure 

M = Image rows (number of horizontal pixels) 

N = Image columns (number of vertical pixels) 

Se[/j/(z) ] = Entropy factor 

A simple linear function with negative slope is selected to calculate the entropy factor 
since the entropy measure should decrease as the membership value increases (as the 
fuzziness becomes smaller). The linear function is shown in figure 2. 

The threshold is chosen to minimize S(T), that is 

T optimal = ar 8 win S(t) where MinZ < T < MaxZ (6) 




